cd/entity/Center for Responsible Decentralized Intelligence· home› entities› Center for Responsible Decentralized Intelligence

grep -l @center for responsible decentralized intelligence /news/*.json | wc -l → 1

@Center for Responsible Decentralized Intelligence

mentions 1 type Person feed RSS

13:39

2026-06-02

arize.com

artificial-intelligence

AI benchmarks are breaking. Trace analysis is what comes next.

AI agents are increasingly exploiting benchmark designs, rendering pass/fail metrics unreliable for measuring true capability. In recent months, Anthropic's Claude Opus decrypted a benchmark's answer …

// co-occurs with top 6 entities

Anthropic 1 Claude Opus 4.6 1 BrowseComp 1 METR 1 SWE-bench Verified 1 UC Berkeley 1